NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives

Dalal, Murtaza; Pathak, Deepak; Salakhutdinov, Ruslan (December 2021, Advances in neural information processing systems)

Despite the potential of reinforcement learning (RL) for building general-purpose robotic systems, training RL agents to solve robotics tasks still remains challenging due to the difficulty of exploration in purely continuous action spaces. Addressing this problem is an active area of research with the majority of focus on improving RL methods via better optimization or more efficient exploration. An alternate but important component to consider improving is the interface of the RL algorithm with the robot. In this work, we manually specify a library of robot action primitives (RAPS), parameterized with arguments that are learned by an RL policy. These parameterized primitives are expressive, simple to implement, enable efficient exploration and can be transferred across robots, tasks and environments. We perform a thorough empirical study across challenging tasks in three distinct domains with image input and a sparse terminal reward. We find that our simple change to the action interface substantially improves both the learning efficiency and task performance irrespective of the underlying RL algorithm, significantly outperforming prior methods which learn skills from offline expert data. Code and videos at https://mihdalal.github.io/raps/
more » « less
Full Text Available
Don’t Copy the Teacher: Data and Model Challenges in Embodied Dialogue

https://doi.org/10.18653/v1/2022.emnlp-main.635

Min, So Yeon; Zhu, Hao; Salakhutdinov, Ruslan; Bisk, Yonatan (January 2022, Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing)

Full Text Available
A Closer Look at Accuracy vs. Robustness

Yang, Yao-Yuan; Rashtchian, Cyrus; Zhang, Hongyang; Salakhutdinov, Ruslan; Chaudhuri, Kamalika (January 2020, Advances in neural information processing systems)
null (Ed.)
Current methods for training robust networks lead to a drop in test accuracy, which has led prior works to posit that a robustness-accuracy tradeoff may be inevitable in deep learning. We take a closer look at this phenomenon and first show that real image datasets are actually separated. With this property in mind, we then prove that robustness and accuracy should both be achievable for benchmark datasets through locally Lipschitz functions, and hence, there should be no inherent tradeoff between robustness and accuracy. Through extensive experiments with robustness methods, we argue that the gap between theory and practice arises from two limitations of current methods: either they fail to impose local Lipschitzness or they are insufficiently generalized. We explore combining dropout with robust training methods and obtain better generalization. We conclude that achieving robustness and accuracy in practice may require using methods that impose local Lipschitzness and augmenting them with deep learning generalization techniques.
more » « less
Full Text Available
On Emergent Communication in Competitive Multi-Agent Teams

Liang, Paul Pu; Chen, Jeffrey; Salakhutdinov, Ruslan; Morency, Louis-Philippe; Kottur, Satwik (January 2020, Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems)

Several recent works have found the emergence of grounded com-positional language in the communication protocols developed bymostly cooperative multi-agent systems when learned end-to-endto maximize performance on a downstream task. However, humanpopulations learn to solve complex tasks involving communicativebehaviors not only in fully cooperative settings but also in scenar-ios where competition acts as an additional external pressure forimprovement. In this work, we investigate whether competitionfor performance from an external, similar agent team could actas a social influence that encourages multi-agent populations todevelop better communication protocols for improved performance,compositionality, and convergence speed. We start fromTask &Talk, a previously proposed referential game between two coopera-tive agents as our testbed and extend it intoTask, Talk & Compete,a game involving two competitive teams each consisting of twoaforementioned cooperative agents. Using this new setting, we pro-vide an empirical study demonstrating the impact of competitiveinfluence on multi-agent teams. Our results show that an externalcompetitive influence leads to improved accuracy and generaliza-tion, as well as faster emergence of communicative languages thatare more informative and compositional.
more » « less
Full Text Available
Harnessing the Power of Infinitely Wide Deep Nets on Small-data Tasks

Arora, Sanjeev; Du, Simon S.; Li, Zhiyuan; Salakhutdinov, Ruslan; Wang, Ruosong; Yu, Dingli (January 2020, ICLR 2020)

Full Text Available
Towards Debiasing Sentence Representations

https://doi.org/10.18653/v1/2020.acl-main.488

Liang, Paul Pu; Li, Irene Mengze; Zheng, Emily; Lim, Yao Chong; Salakhutdinov, Ruslan; Morency, Louis-Philippe (January 2020, Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics)

As natural language processing methods are increasingly deployed in real-world scenarios such as healthcare, legal systems, and social science, it becomes necessary to recognize the role they potentially play in shaping social biases and stereotypes. Previous work has revealed the presence of social biases in widely used word embeddings involving gender, race, religion, and other social constructs. While some methods were proposed to debias these word-level embeddings, there is a need to perform debiasing at the sentence-level given the recent shift towards new contextualized sentence representations such as ELMo and BERT. In this paper, we investigate the presence of social biases in sentence-level representations and propose a new method, Sent-Debias, to reduce these biases. We show that Sent-Debias is effective in removing biases, and at the same time, preserves performance on sentence-level downstream tasks such as sentiment analysis, linguistic acceptability, and natural language understanding. We hope that our work will inspire future research on characterizing and removing social biases from widely adopted sentence representations for fairer NLP.
more » « less
Full Text Available
Learning Factorized Multimodal Representations

Tsai, Yao-Hung Hubert; Liang, Paul Pu; Zadeh, Amir; Morency, Louis-Philippe; Salakhutdinov, Ruslan (February 2019, International Conference on Representation Learning)

Learning multimodal representations is a fundamentally complex research problem due to the presence of multiple heterogeneous sources of information. Although the presence of multiple modalities provides additional valuable information, there are two key challenges to address when learning from multimodal data: 1) models must learn the complex intra-modal and cross-modal interactions for prediction and 2) models must be robust to unexpected missing or noisy modalities during testing. In this paper, we propose to optimize for a joint generative-discriminative objective across multimodal data and labels. We introduce a model that factorizes representations into two sets of independent factors: multimodal discriminative and modality-specific generative factors. Multimodal discriminative factors are shared across all modalities and contain joint multimodal features required for discriminative tasks such as sentiment prediction. Modality-specific generative factors are unique for each modality and contain the information required for generating data. Experimental results show that our model is able to learn meaningful multimodal representations that achieve state-of-the-art or competitive performance on six multimodal datasets. Our model demonstrates flexible generative capabilities by conditioning on independent factors and can reconstruct missing modalities without significantly impacting performance. Lastly, we interpret our factorized representations to understand the interactions that influence multimodal learning.
more » « less
Full Text Available
External vs. Internal: An Essay on Machine Learning Agents for Autonomous Database Management Systems

Pavlo, Andrew; Butrovich, Matthew; Joshi, Ananya; Ma, Lin; Menon, Prashanth; Van Aken, Dana; Lee, Lisa; Salakhutdinov, Ruslan (June 2019, IEEE bulletin)

Full Text Available
Video Relationship Reasoning using Gated Spatio-Temporal Energy Graph

Tsai, Yao-Hung Hubert; Divvala, Santosh; Morency, Louis-Philippe; Salakhutdinov, Ruslan; Farhadi, Ali (January 2019, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition)

Visual relationship reasoning is a crucial yet challenging task for understanding rich interactions across visual concepts. For example, a relationship 'man, open, door' involves a complex relation 'open' between concrete entities 'man, door'. While much of the existing work has studied this problem in the context of still images, understanding visual relationships in videos has received limited attention. Due to their temporal nature, videos enable us to model and reason about a more comprehensive set of visual relationships, such as those requiring multiple (temporal) observations (e.g., 'man, lift up, box' vs. 'man, put down, box'), as well as relationships that are often correlated through time (e.g., 'woman, pay, money' followed by 'woman, buy, coffee'). In this paper, we construct a Conditional Random Field on a fully-connected spatio-temporal graph that exploits the statistical dependency between relational entities spatially and temporally. We introduce a novel gated energy function parametrization that learns adaptive relations conditioned on visual observations. Our model optimization is computationally efficient, and its space computation complexity is significantly amortized through our proposed parameterization. Experimental results on benchmark video datasets (ImageNet Video and Charades) demonstrate state-of-the-art performance across three standard relationship reasoning tasks: Detection, Tagging, and Recognition.
more » « less
Full Text Available
Transformer Dissection: An Unified Understanding for Transformer’s Attention via the Lens of Kernel

https://doi.org/10.18653/v1/D19-1443

Tsai, Yao-Hung Hubert; Bai, Shaojie; Yamada, Makoto; Morency, Louis-Philippe; Salakhutdinov, Ruslan (January 2019, Proceedings of the Conference on Empirical Methods in Natural Language Processing)

Full Text Available

« Prev Next »

Search for: All records